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Abstract 

. with binary response. The target is to find the design that maximizes the power of the relevant 



We consider the classical problem of selecting the best of two treatments in clinical trials 



test. Many papers use a normal approximation to the power function and claim that Neyman 
allocation that assigns subjects to treatment groups according to the ratio of the responses' 
standard deviations, should be used. As the standard deviations are unknown, an adaptive 
design is often recommended. The asymptotic justification of this approach is arguable, since 
it uses the normal approximation in tails where the error in the approximation is larger than 
the estimated quantity. We consider two different approaches for optimality of designs that are 
related to Pitman and Bahadur definitions of relative efficiency of tests. We prove that the 
optimal allocation according to the Pitman criterion is the balanced allocation and that the 
optimal allocation according to the Bahadur approach depends on the unknown parameters. 
Exact calculations reveal that the optimal allocation according to Bahadur is often close to 
the balanced design, and the powers of both are comparable to the Neyman allocation for 
small sample sizes and are generally better for large experiments. Our findings have important 
implications to the design of experiments, as the balanced design is proved to be optimal or 



*We thank Amir Dembo for helpful comments and in particular for deriving the exact expression for v* in (4). 



close to optimal and the need for the complications involved in following an adaptive design for 
the purpose of increasing the power of tests is therefore questionable. 



KEYWORDS: Neyman allocation, adaptive design, asymptotic power. Normal approximation. 
Pitman efficiency, Bahadur efficiency, large deviations. 

1 Introduction 

We consider the problem of optimal allocation of individuals to two treatment groups with the goal 
of selecting the better treatment. The problem arises frequently in clinical trials, which usually 
have several possibly conflicting purposes such as minimizing the number of subjects treated in 
the inferior treatment or maximizing the power of the relevant test. The current paper focuses on 
the latter goal and aims at answering the flrst question appearing in the chapter "Fundamental 
questions of response-adaptive randomization" of the book by Hu and Rosenberger (2006): what 
allocation maximizes power? It appears that the accepted answer to that question is the Neyman 
allocation, see references below. However, it is shown, both theoretically and by exact calculations, 
that the balanced allocation, that is, assigning an equal number of subjects to each treatment, is 
optimal or close to optimal. Unlike Neyman allocation, the balanced allocation does not depend 
on unknown parameters, and therefore no adaptive estimation is required. Adaptive designs are 
complex by nature, and our results question the need for the conducting such designs when the goal 
is to maximize power. 

Let A and B be two treatments with unknown probabilities of success, pa and pb- A trial with 
n subjects is planned with NA{n) and NB{n) subjects assigned to treatment A and B, respectively, 
where NA{n) + Nsin) = n. For each subject, a binary response, success or failure, is observed. 
Let Un '■= NA{n)/n be the proportion of subjects assigned to treatment A. We sometimes refer to 
Un as the allocation. The design problem considered here is of choosing the optimal allocation z/„ 
that maximizes the power of the standard test of the hypothesis Pa = Pb versus one or two-sided 
alternatives. For given n, pA, and pb, the optimal allocation fraction z/„ can be found by a flnite 
search over all possible allocations. Here we study this problem for large n instead, and look for the 
asymptotically optimal allocation fraction u*. 

Let Yi{m) ~ Bin{m,pi) be the number of successes if m patients are assigned to treatment i 
(z = A,B). Let also pa = PA{NA{n)) = ^^^^^ and pb = PB{NB{n)) = be the estimators 
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of Pa and ps', note that pA and pB depend on n and the allocation sequence Un, however they are sup- 
pressed for notational convenience. The Neyman allocation rule, z/ = a/pa(i pa) — minimizes 
the variance of the estimator pA{n) —pB{n) for the difference of probabilities (e.g., Melfi et. al (2001)). 
However, it is not clear that the Neyman allocation also maximizes the power of the Wald test for 
equality of proportions, as appears to be widely believed (e.g., Brittain and Schlesselman (1982); 
Rosenberger et. al (2001); Hu and Rosenberger (2003); Bandyopadhyay and Bhattacharya (2006); 
Hu et. al (2006); Hu and Rosenberger (2006); Tymofyeyev et. al (2007); Biswas et. al (2010); Zhu and Hu 
(2010); Chambaz and van der Laan (2011)). For example, when comparing the Neyman allocation 
to the balanced design, the latter authors claim that "resorting to the balanced treatment mechanism 
may be a very poor (inefficient) choice". Below we show that this claim is asymptotically incorrect. 
The standard Wald statistic for comparing pA and pb is 

W := {Pb - Pa] j ^/V{pA,pB,n,Un), 

where VipA^PB^n, i/n) = + if37T?r- the above papers, the power is often calculated by 

approximating the distribution of the squared Wald statistic by a non-central chi-square distribution; 
the Neyman allocation then maximizes the non-centrality parameter. The argument is based on the 
following normal approximation: 

D ^u/ ^ ^ _ D ,Pb-Pa-{pb-Pa)^ zi^a ■ ^yV{pA,pB,n,Un) " {pb " Pa) 

^PA,PbV^ > ^l-a) - ^PA,PB I > 



\/V{pA,PB,n,Un) \/V{pA,PB,n,Un) 

^ l^c^f - VV{PA,pB,n,Un) - {pb-Pa) \ i_c^ ( _ Pb-Pa 

\ V^{pA,PB,n,Un) J V ° y/V{pA,PB,n,iyn) 

where $ is the standard normal distribution function, and Zi_a = $^^(1 — a). The Normal approx- 



imation is valid only if {pb — Pa)/ ^/V{pa-,Pb, n, Vn) = 0(1), i.e., when pb — Pa ^ n ^/"^ . However, 



for fixed Pb — Pa > 0, the term {pb — Pa) / ^/ViPA, Pb, Vn) is of order a/ti, and the expression 



$ P-B PA jg Qf asymptotic order that is smaller than the precision of the normal ap- 

\ \/y(PA,PB,ri,Vn) J 

proximation, and therefore its use is problematic. Thus, the claim that Neyman allocation maximizes 
the power seems theoretically questionable. 

For asymptotic power comparisons and evaluation of the relative asymptotic efficiency of certain 
tests, two different criteria are often used, related to the notions of Pitman and Bahadur efficiency 
(see e.g., van der Vaart (1998), Chapter 14). In our context, the Pitman approach looks at sequences 
of probabilities p\ > p\ that tend to a common limit at a suitable rate. The Bahadur approach 
considers fixed probabilities pa and pb and approximates the power using large deviations theory. 



We show in the next sections that the optimal allocation corresponding to the Pitman approach 
is always u* = 0.5 while the Bahadur optimal allocation depends on pA and pb and can be calculated 
in a way described below. Interestingly, computation of the Bahadur criterion for different values 
of Pa and pb reveals that the optimal allocation is often close to 0.5. In disagreement with some 
of the papers mentioned above, these results cast doubts on the asymptotic justification of adaptive 
designs and show that, at best, such designs can lead to a practically negligible improvement over a 
non-sequential balanced design in terms of power. 

The paper is organized as follows: Sections 2 and 3 describe the approaches of Pitman and 
Bahadur for maximizing the power, and find the corresponding optimal rules. In Section 4, the 
optimal allocation according to the Bahadur criterion is calculated for different parameters and 
compared to the Neyman allocation. Exact calculations are performed for a wide range of parameters. 
A related problem that arises in dose findings experiments is discussed in Section 5; the Neyman 
allocation is shown to be optimal or close to optimal in this case. Section 6 extends the Bahadur 
approach to general (rather than binary) responses; concluding remarks are given in Section 7. All 
proofs are given in the Appendix. 



2 The Pitman Approach 

Pitman relative efficiency provides an asymptotic comparison of two families of tests applied to 
a sequence of statistical problems. Here we utilize the same idea to compare different allocation 
fractions. 

Consider a sequence of statistical problems indexed by k, where Pa = P+ p'h = P + foi' 
5a < Sb and < p < 1. Let = nk{SA,SB,P,o:, f3,{h'n}) be the minimal number of observations 
required for a one-sided Wald test at significance level a and power at least /3 (for /3 > a) at the 
point p\iP^Bi "where the observations are allocated to the two groups according to the fraction Set 
nfc = oo if no finite number of observations satisfies these requirements. The next theorem implies 
that the balanced allocation is asymptotically optimal. 

Theorem 1. Fix 5a < Sb, a < P and < p < 1. Let {un} be a any sequence of allocations and let 
{i^n} be another sequence of allocations satisfying z>„ — )■ 1/2. Then 

77 X a r~ IN > 1- 

fc^oo nk{dA,dB,P,a,l3,{Un\) 
The theorem follows readily from the following lemma, proved in the Appendix. 
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Lemma 1. /• // m« v for < < 1 then 



k^oo k \ {6b - Sa) y z^(1 - ' 




//. If Un ^ or Un ^ 1 then 



lim — = oo 

fc— >CXD k 



III. For any sequence of allocations {vn} 



k^oo k \ (Sb — 5a) 

Theorem 1 holds also when considering a two-sided test. The theorem shows that the balanced 
design is asymptotically optimal in the Pitman sense, and as a consequence, one cannot gain efficiency 
(in the above sense) by considering sequential adaptive designs. The key point here is that when p\ 
and p'^ converge to the same value p, the variances of their estimators converge to the same value 
and hence the limiting Neyman allocation is 1/2 regardless of p. This phenomenon is not observed 
in problems concerning the Normal distribution or similar cases where the variance is not a function 
of the mean. 

It can be argued that rather than considering sequences of statistical problems as above, one 
should optimize for fixed pa and pb- The next section deals with this case. 



3 The Bahadur Approach 

In this section, large deviations theory is used to approximate the power of the Wald test for fixed pa 
and Pb- This power increases exponentially to one with at a rate that depends on the allocation 
fraction u. Recall that pA and pB depend on both n and an allocation z/„. The aim is to find the 
optimal limiting allocation fraction u* for which the rate is maximized. We prove the following large 
deviations result: 

Theorem 2. Define 

H{t, v) := V log(l -PA+ PAc'^n + log(l -PB+ PBe^'^^'-"^), 

and let giy) := inf(>o Hit, u) . 
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/. One sided test: assume that Pb > Pa and — ?■ v, where < < 1, then for any constant 
K>0 

lim-logi 1-P ( , PB-PA >K]\=gM. (2) 

II. Two sided test: assume that Pb Pa o.n-d Vn ^, where < < 1, then for any constant 
K>0 

limllog|l-p M^«:^^'\ >A-)U,M. (3) 

" n [ \V{pA,PB,n,Un) J J 

///. Ifun^O or 1 then (2) and (3) hold with g{0) = g{l) = 0. 

Note that ps —Pa is not an average of n i.i.d random variables and, therefore, Theorem 2 does not 
follow directly from the Cramer-Chernoff theorem (see e.g., van der Vaart (1998), p. 205), however, 
its proof uses similar ideas. 

For each fixed n, let z/„ = Vn {pa,Pb, K) be the allocation that maximizes the power of the 
one sided test for a total sample size of n subjects, i.e, 

*(i) ID ( Pb-Pa ^ 

vj' > = argmax,^ ^ri n-i,i^ , > K 

\^V{pA,PB,n,u^) 

similarly, Un^^^ = u^'^\pa,Pb^K) is the optimal allocation of the two-sided test. 

Let V* = i'*{paiPb) '■= argminj^5f(i^). It is easy to prove directly that g is strictly convex, and 
the minimum is attained uniquely. More generally, it is readily shown by differentiation that if 
M(t) = Ee^-^ is a moment generating function, then vM{t/v) is a convex function of v. Theorem 2 
suggests the use of v* as the design fraction. However, for a given n, the optimal allocation, is not 
necessarily i^*, but the fraction z/n^^^ or z/n''^^ for the one or two-sided test, respectively. Therefore, it 
is reasonable to use v* as the design fraction only if u^^'^ — )■ v* for i = 1,2. The following theorem 
shows that this is indeed the case. 

Theorem 3. /. If ps > Pa then for any K>0, Un" V*. 
II. If Pb Pa then for any > , z/^*'^^ — )■ v* . 

Remark 1. Another formulation of these results, for the one-sided case, say, is the following: assume 
that Pb > Pa then for any sequence z/„ and constant K > 

\immf-\og\l - P ( ^=fM=E£== > K ] }>g{u*] 

" ^ [ \VV{PA,PB,n,Un) 

and the infimum is attained for sequences z/„ — )■ i/*. 



Table 1: The optimal Bahadur allocation for different parameters compared to Neyman allocation. 



PA 


Pb 


u* 


Neyman allocation 


0.5 


0.8 


0.518 


0.556 


0.5 


0.65 


0.504 


0.512 


fi 


75 


510 


531 


0.7 


0.75 


0.505 


0.514 


0.7 


0.85 


0.521 


0.562 


0.7 


0.9 


0.535 


0.604 


0.85 


0.95 


0.541 


0.621 


0.5 


0.9 


0.542 


0.625 



Remark 2. When pA < Pb represent success probabilities of two treatments, and treatment B is 
selected as better ifpsin) > pA{n), then the expression in (2) with K = approximates the probability 
of incorrect selection. 



4 Numerical Illustration 

Some tedious calculations show that 

.• = .og| "-'"^'"^ l/logj^^'^'"^!}. (4) 
'^\(l-pB)l0g(i5ff)// npA(i-PB)/ ^' 

Table 1 compares the asymptotic Bahadur optimal allocation and the Neyman allocation for several 
pairs {paiPb)- The table and further systematic numerical calculations indicate that the Bahadur 
allocation is closer to 0.5 than the Neyman allocation and that it is quite close to 0.5 unless pa 
and Pb are very far apart (e.g., pa = 0.5, = 0.9). In the latter case, the power is close to 1 for 
any reasonable allocation. These findings justify the use of the balanced allocation and question the 
utility of more complicated adaptive sequential designs. 

We preformed some exact calculations to compare the Bahadur allocation, the balanced allocation 
and the Neyman allocation. Figure 1 compares the difference between the maximal possible power 
for sample size 200 and 500, and the power under the different allocation methods for the two-sided 
test with a = 0.05 and for different parameters. The power is calculated exactly using R. While for 
moderate sample size (n = 200) no allocation is better for all the parameters we considered, for large 
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sample size {n = 500), Bahadur is better for almost all parameters, and the balanced allocation is 
usually better than Neyman; however, the differences in power are relatively small. 

n=200 n=500 



C\J 




0.50 0.55 0.60 0.65 0.70 0.75 0.50 0.55 0.60 0.65 0.70 0.75 

p_A p A 

(a) Moderate sample size {n = 200) (b) Large sample size {n = 500) 

Figure 1: The differences between the maximal power of the two-sided test with critical value K = 1.96, attainable 
(by Vn^'^'^) and the Neyman allocation (black), the balanced allocation (red) and the Bahadur allocation (blue) for 
PA — 0.5, . . . , 0.75, pb = PA + 0.2; for moderate (n = 200) and large (n = 500) sample size. 

Figure 2 shows the power of the two-sided test for different allocations where pA = 0.7, ps = 0.9; 
it is clearly seen that the Neyman allocation, which is widely recommended for maximizing the 
power, is far from being optimal. Thus, the exact calculations presented in this section support the 
theoretical results: the balanced allocation is usually better than the Neyman allocation for large 
samples, and they are indistinguishable for small samples. In all cases, the differences are quite 
negligible, and therefore the balanced allocation should be preferred due to its simplicity. 

5 A Related Problem 

Dose finding studies are conducted as part of phase I clinical trials in order to find the maximal 
tolerated dose (MTD) among a finite, usually very small, number of potential doses. The MTD is 
defined as the dose with the closest probability of toxic reaction to a pre-specified probability po. 
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CD 
CD 

ci 



0.40 0.45 0.50 0.55 0.60 0.65 

V 

Fi gure 2: The power of Wald tests with critical value K = 1.96 for different allocations where pA = 0-7, Pb = 0.9 
and n = 500. The smooth line is a parabolic fit to the function. The vertical lines show the balanced allocation 
(ly — 0.5), the Bahadur allocation {ly = 0.5349374) and the Ncyman allocation {i/ ~ 0.6043561). 

Recently, we showed that under certain natural assumptions, in order to estimate the desired dose 
consistently, one can consider experiments that eventually concentrate on two doses (Azriel et. al 
(2010)). Thus, asymptotically, the allocation problem in MTD studies reduces to the problem of 
finding which of two probabilities of toxic reaction pa < Pb (corresponding to the doses (Ia < ds) is 
closer to po- 

Let Pa and pb denote the proportions of toxic reactions in doses (Ia and (Ib based on a total 
sample size of n individuals and an allocation For large n, pA < Pb, and a natural estimator for 
the MTD is MTD = cLa if {Pa + Pb)/2 > p^ and MTD = ds otherwise. Similar to the problems 
discussed in previous sections, an optimal design is an allocation rule of n z/„ and n{l — Vn) individuals 
to doses dA and ds, respectively, such that P{MTD = d^) = P{{pA +Pb)/'^ > Po) is maximized if 
dA is indeed the MTD. 

For the current problem, the Pitman approach is translated to a comparison of designs under 
sequences of parameters p^, and p^ such that \{pa +p%)/2 — Pq\ = K/y/k, for fixed < < oo, 
and Pa Pa, Pb ~^ Pb- Let < u < 1 and let = nk{j)\,p%,p'^, a, be the minimal number 

of observations required such that the probability of incorrect estimation of the MTD is smaller than 
a for the given parameters when the allocation for dose dA '^^ n- Vn- As in Lemma 1, it can be shown 
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Table 2: Comparison of the Bahadur and Pitman (here Neyman) allocation rules for different parameters. 



Pa 


Pb 


Po 


Bahadur 


Pitman 


n 1 


u.o 




n 490 


U.Oc/U 


0.2 


0.35 


0.3 


0.460 


0.456 


0.22 


0.33 


0.3 


0.471 


0.468 


0.25 


0.35 


0.33 


0.479 


0.476 


0.2 


0.4 


0.33 


0.455 


0.449 


0.1 


0.4 


0.3 


0.400 


0.380 



that if z/„ — J- 1/ then 



fc-5>oo A; I 2K ) 1 z/ 



Thus, the asymptotically optimal design uses Neyman allocation, u = \^a{^ pa) — 

minimizes the limit of Uk/k. Unlike the previous problem, now p\ and do not converge to the 
same value under the Pitman approach as defined here, and hence the Neyman allocation does not 
reduce to the balanced design. 

For the case of fixed pa, Pb, and po, assume that ps is nearer than pA to po, and consider the 
problem of minimizing the probability of selecting (Ia- The following theorem, analogous to Theorems 
2 and 3, gives the asymptotic optimal allocation rule in the current setting. 

Theorem 4. Let z/„ = NA{n)/n, < u < 1, and assume that z/„ — )■ u, then, 

lim -logP[{pA + PB}/2>po] = tp{u), 

n— >-oo 77, 

where iplu) = inft{z/log(l — Pa + Pa^^^") + {1 — u) log(l — Pb + PBe'^^^^""^) — 2pot}. 

Moreover, let v* = argmin?/;(z/), and let be the value of the allocation minimizing P[{pa + 
Pb}/'2 > Po] for a given n. Then, ^ u* . 

We calculated v* for several values oipA and pb and found that v* is often close to the Neyman 
allocation, see Table 2. Both criteria, Bahadur and Pitman, yield quite similar results in this problem. 
Allocating subjects according to the Neyman or Bahadur improves the probability of correct MTD 
estimation compared to the balanced allocation for very large samples, as the optimal allocations 
according to Bahadur or Pitman are far from 0.5. Calculations not presented here, show that for 
practical sample sizes for the MTD problem, all three methods differ in a negligible way. 
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6 A General Response 



In previous sections, we dealt with the very important, though specific, case of a binary response. In 
this section, we consider the more general case where the response of an individual treated in group 
A (B) follows a distribution Fa {Fb) having moment generating function MA{t) {MB{t)), and find 
the optimal allocation according to the Bahadur approach. Let YA{m) {YB{m)) denote the average 
of m responses of subjects having treatment A {B). Assume that the treatment with the largest 
mean response is declared better at the end of the experiment. The following theorem, which can 
be proved in a similar way as Theorems 2 and 3, provides the Bahadur optimal allocation rule for 
correct selection: 

Theorem 5. Assume that treatment B is better, i.e, J xFsidx) > J xF^ldx), and that Un — )■ ly. 
Then, 

lim -logP{F4(nz/„) >?ij(n(l-z/„))} = Mi^), 

n— >oo n 

where 

h{u) = inf[z/log{M^(t/z/)} + (1 - u)\og{MB[-t/{l - u)]}]. (5) 

Moreover, let v* = a.Tgmin^ h{u) , and z/* be the value of the allocation minimizing 
P{YA{nUr,) > Ysinil - u^))} . Then K^v*. 

When the responses in the two treatments are normally distributed, then the Bahadur allocation 
agrees with the Neyman allocation. This can be easily verified by using the moment generating 
functions of Normal variables in (5). However, for other distributions, the allocations suggested by 
the Bahadur and the Neyman criteria may differ considerably. Table 3 compares the Bahadur and 
the Neyman allocations for different Poisson and Gamma distributions. The two rules clearly differ. 
As in the Binomial case, the Bahadur allocation is closer to 0.5 than to the Neyman allocation. 
Further study is required to determine if the improvement over the balanced allocation, in terms of 
power or probability of correct selection, is significant. Anyway, optimality of the Neyman allocation 
for non-normal distributions should be questioned, and may hold only under restrictive conditions. 

7 Conclusions 

We discussed asymptotic approximations of power and probability of correct selection in testing and 
selecting the best treatment, and in MTD finding, and related optimal allocation of subjects to 
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Table 3: The optimal allocation i^* for different distributions compared to the Neyman allocation. 



Fa 


Fb 


Bahadur allocation 


Neyman allocation 


Poisson(l) 


Poisson(2) 


0.471 


0.414 


Poisson(2) 


Poisson(3) 


0.483 


0.449 


Poisson(3) 


Poisson(4) 


0.488 


0.464 


Poisson(4) 


Poisson(5) 


0.491 


0.472 


Gamma(0.5,0.5) 


Gamma(0.5,0.6) 


0.515 


0.590 


Gamma(0.5,0.5) 


Gamma(0.5,0.7) 


0.528 


0.662 


Gamma(0.5,0.5) 


Gamma(0.5,0.8) 


0.539 


0.719 


Gamma(0.5,0.5) 


Gamma(0.5,0.9) 


0.549 


0.764 



treatments. 

Neyman allocation is optimal when the response is Normal, and it is asymptotically optimal 
in the Pitman sense, that is, for converging sequences of alternatives as described above. In the 
binary response selection problem in which and ps become closer, Neyman allocation reduces to 
a balanced allocation, independent of the parameters pa and pb- The Bahadur allocation for fixed 
Pa and ps turns out to be close to balanced, and therefore, by both criteria, our conclusion is that 
for the purpose of maximizing the power, adaptive allocation seems unwarranted, and the simpler, 
non-sequential balanced allocation should be preferred. However, when other criteria (e.g., ethical 
criteria) are of primary concern, as is often the case in clinical trials, the balanced design is not 
necessarily optimal and adaptive designs may be found beneficial. 

Our findings are partly in contrast with the literature that bases allocations on the noncentrality 
parameter appearing in a Normal or Chi-Square approximation (e.g., Rosenberger et. al (2001); 
Tymofyeyev et. al (2007)). These designs minimize or control the variance of the difference but need 
not be efficient in the sense of controlling or maximizing the power. 

A Appendix 

Proof of Lemma 1 part I. The proof, included here for completeness, uses arguments as in 
Theorem 14.19 in van der Vaart (1998) (p. 205), which is stated in terms of relative efficiency rather 
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than allocation. 

First note that limkUk = oo; otherwise, there exists a bounded subsequence of Uk on which the 
power converges to a value < a, since as A; — )■ oo we have p\—p% 0. This contradicts the definition 
of Uk and the assumption that a < (5. 

By the Berry-Esseen theorem we have 



since the third moment is bounded; a similar limit holds for p^. Here we use the notation p\ = 
PAi^nk^k) = Y^i^Unf. rik) / {h'rik TT'k) , whcre Y^{m) ~ Bin{m,p\) is the sum of m independent binary 
responses with probability p\. 
Now, if Ujik — )■ we have 



/ p(l^P) 

Since — !■ oo, the critical value for the level a one-sided Wald test is zi^a + o(l); then 



' p(i-p) / p(i^p) 



Also, 



''^'c' ^ni)nk a.s 



P(l-P) 



^-4-1, 



and since the limiting power is exactly /3 we have due to (6) 

{5B-5A){\ir^k v^} _ 

/ p(l-p) 

V Ki-^^) 

hence (1) holds. 

Proof of part II. We only prove the case Vn — 0, as — ?■ 1 is similar. If nz/„ is bounded, then the 
power converges to a and = oo for large k. 

Assume now that n z/„ — oo; by the Berry-Esseen theorem and Slutsky's Lemma we have 

^/n^Ap\ -{P+ ^)) ^ Ar(0,p(l - p)) and y/W^wipB - (P + ^)) ^ 0. 
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This implies that 
and by arguments as in the first part we have 

= ( s,-s, ) 

Because z/^^. — )■ 0, hm^ ^ = 00. 

Proof of part III. There exists a subsequence {k'} such that z/„,^, — t- v' for some v' and 

-\ 2 

. r^k rik' (zi_a-zi-p I p{l-p) 

hm mi — = hm — ' ' 



k^oo k k'^00 k' \ {Sb — Sa) V i^'i^ — v') 

where the second equahty follows by part I. If z/'(l — z/') = we interpret the limit as 00; since 
u'{l — u') <^ the third part of the lemma follows. □ 
Proof of Theorem 2 parts I and II. The proof follows known large deviations ideas; however, 
certain variations are needed for the present non-standard case. Notice that the probability in part 
I is larger than the probability of part II (for \/K). Therefore, it is enough to show that for any 
K >0 

limsup-logi 1 -P I >K]}<g{u), (7) 



and for any K > 



hminf i log (1 - P ( J^"" . . >K]}> g{u) 

n n { \V{pA,PB,n,Un) 



In fact, instead of the latter inequality we prove in the sequel a stronger result, namely 

liminf - logP I < Pa-Pb ^ j., \ ^ ^^^^^ 

" ^ V V^iPA,PB,n,u^) J 

for all K' > 0, which is also used for the case of = in part I, when K' = 00. 



For the upper bound (7), define S{n) := ^/V{pa-,Pb, J^n, n) ■ n; notice that S{n) is bounded. 
Hence, for any e > and for large enough n we have 

l-p( ^1II £==>k] =p(p^-p^>-J^Sin))<PipA-pB>-s). 

\V^{PA,PB,n,Un) J \ J 

Now, for any t > 0, 

P(pa -PB >-£:) = P(e ^^A(")/" iv^M/n > > e'"-^") < E[e^^^ ^ ^]e"*^ 
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by Markov's inequality. We can write the latter term as 

Since — )■ z/, and the inequality holds for all t > 0, 

limsup - logP ( > — ^S{n)] < ge{i^), 



where Qsiy) '■= inft>o{^^ + H(t, u)}. This is true for any e > 0, and by the continuity of Qeii^) in e 
we have for any K > 

lim sup - log P ( - > — ^S{n)) < g{u), 



which verifies (7). 

To prove (8), assume without loss of generality that pb > Pa] define 

Tn ■■= PAynVn) - PB[n[l - Vn)) = 7Z ^ • 

The log of the moment generating function of T„ is 

logE[e*^"] = nz/„log(l - PA + PAe"^) + n{l - i^n) log(l -pB +PBe"(i-'")) = nH(J-,Vn)- (9) 

Since -E[T„] = pa —Pb < 0, by (9) we have -j^HiQ, z/„) < 0. Also, i/(0, Un) = and z/.„) is strictly 
convex being the log of a moment generating function, up to a constant. Since P(T„ > 0) > it 
follows that H{t,Vn) — > oo as t — )■ oo and therefore, argminf>o t'n) ='■ t^o^ is a unique interior 
point and ^H{tl^\h'n) = 0. Let to be the minimizer of H{-^iy); we show that t^^^ — )■ to- If there 
is a subsequence {tg"*'^} that converges to ti < oo then H {t'^''\ Unf.) < //(to,^'„j.) (as tg"'"'' is the 
minimizer) implies H{ti, v) < H{tQ, v) and therefore ti = to as the minimizer is unique and finite. 
Define a new random variable Z„, which is the Cramer transform of T„ 

P[Zn = z) := e-"^("")e^*o"'"P(T„ = z). 

Now, 

\ VV{PA,PB,n,Un) J \ Vn ' 

= E[I{0 < Z„ < A5(^)}e-^'^*o"'"]e"f('^") > P(0 < Z„ < -^5(n))e"^W-"^^*^"'^e"^^('^"), 
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where the last inequahty holds since e " > e > e 2 y i--^™ _ follows that 



g[Vn) logP < ^ <K\< h 



n 



V^iPA,PB,n,Ur,) J n y/n ' 

Clearly, the second term on the right-hand side vanishes as n goes to infinity; for the first, we claim 
that y/nZn is asymptotically N{0, ^H{tl^\un)) and consequently P(0 < Zn < ■^S{n)) — C for 
some constant C > 0. Indeed, the log of the moment generating function of \fnZn is 



logE[e^^^i = -ng{v,:) + logE[e^"(^^+*o"'-)] = n{-E{tt\ ^n) + H{t^^^ + ^, 



(n) 



where the last equality follows from (9) and the identity g{yn) = H{tQ\ By Taylor expansion of 



H{-, Un) around tg"'' we obtain 



since the first derivative is 0, and therefore, 

logi?[e^v/^^"]^^^i/(*\..). 

We conclude that 

hmsup { g{u^) - -logP ( < -—^^^l£== <K]}<0, 

V V^yPA,PB,n,Un) 



hence. 



liminf-logP I < < A' 1 > g{iy) 

\/V{pA,PB,n,Pn) 



n 

and part 1 and 11 follow. 

Proof of part III. First note that (7) clearly holds with giy) = as log{l — P(-)} < 0, so it remains 
to prove (8) for g{u) = 0, that is, for any K > 

Hminf - logP(0 < Pa-Pb ^ ^ ^ ^ 

We only prove the case Vn — ?■ 0, as z/„ — > 1 is similar. If z/„ y^- 00 then is inconsistent and the 
limit is easily seen to be zero. Assume now that n z/„ — t- 00; since 

..... ^ PaO--Pa) , Pb{^-Pb) ^Pa{^-Pa) 
V{pA,PB,i^n,n) = \ > 



ni^n. nil — Un) 



we have 

/XTP-- ^P{^^PA-PB< J= ). 

y/V{pA,PB,'^n,n) y/nUn 
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Now, for e := MzM)^ 

P[0 <Pa-Pb < — , — ) > ^({0 <Pa-Pb < — , — 1 n {pb e [pb ^=,Pb)}) 

> P[Pb <Pa<Pb^ ^= )P(Pb e [Pb ^=,Pb))- 

Taking logs and limits in the above product, we have to consider two parts. For the first, we have 
by Lemma 2 below 



hm log P[pb <Pa<Pb^ ; ) = C. 



for some constant C; therefore, 



hm - log <Pa<Pb^ ; ) = 0. 

The limit of the log of the second part divided by n is 0, since 

P{PB e {pB - — > P{-e < ^/n{l-iJ^){PB-pB) < 0) ^ C" > 

by the CLT. □ 

Lemma 2. Let Vi, V2, ... be i.i.d with EVi < and moment generation function M{t), and let Xn he 
positive and uniformly hounded random variables that satisfy Xn K for a constant K > 0; then, 

limilogPfO <Vn< ^) = infllogMm}. 



Proof of Lemma 2. The lemma follows by the same argument as in van der Vaart (1998), p. 206 
(replacing e in that proof by where K is the bound of X„); see also the proof of parts I and II 
of Theorem 2, where a similar argument is used. □ 
Proof of Theorem 3. We will prove part I; the proof of Part II is similar. Consider the sequence 
of allocations v'^ = ^"'^ ^ ; Theorem 2 implies that 

Hm i log ( 1 - P f >k]\^ ,(.*). (10) 

Now, let z/ 7^ z/*, < < 1, be a limit of a certain subsequence {uk}, i.e., z/n[.^^ — )■ z/, and define 
e = {g{u) - g{v*))/2. By (10), there exists N such that for ?i > 

UoAl-pi^-^^^^M^£^>K\\<g[.n+e. 
" I V ^V{pA,pB,n,v'n) 
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Since Un^^^ — > i/ we have by Theorem 2, for large enough k 



' log 1 1 - P ( - '^"-''^^ - ^^^^^^ "--''^ > K > 9ii>) - e = gju*) + 



where g{0) = g{l) = 0. Hence, there exists Uk > N such that 



in contradiction to the optimality of i^n^^^; therefore the limit of every converging sub-sequence is u*. 
□ 

The proofs of Theorems 4 and 5 are omitted because they are very similar to the proofs 
of Theorems 2 and 3. 
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